Improvements to Supercomputing Service Availability Based on Data Analysis

نویسندگان

چکیده

As the demand for high-performance computing (HPC) resources has increased in field of computational science, an inevitable consideration is service availability large cluster systems such as supercomputers. In particular, factor that most affects supercomputing services job scheduler utilized allocating resources. Consequent to submitting user data through analysis, 25.6% jobs failed because program errors, or I/O errors. Based on this we propose a K-hook method scheduling increase success rate submissions and improve services. By applying method, job-submission was improved by 15% without negatively affecting users’ waiting time. We also achieved mean time between interrupts (MTBI) 24.3 days maintained average system at 97%. research verified Nurion supercomputer real environment, value expected be found significant improvements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

fuzzy threat assessment on service availability with data fusion approach

service availability is important for any organization. this has become more important with the increase of dos attacks. it is therefore essential to assess the threat on service availability. we have proposed a new model for threat assessment on service availability with a data fusion approach. we have selected three more important criteria for evaluating the threat on service availability and...

متن کامل

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

A New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability

Data Grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. The heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. Most existing scheduling algorithms in Grids only focus on one kind of Grid jobs which can be data...

متن کامل

Near Term Improvements to WAAS Availability

Since 2003, when it was first declared operational, the Wide Area Augmentation System (WAAS) has been increasing its availability through successive software updates (for example, new monitoring algorithms) and hardware updates (expanded reference receiver network). Today, WAAS provides vertical guidance to more than 3000 runways in the United States. With the current GPS constellation, WAAS ha...

متن کامل

Improvements of Aircraft Availability

Improvements in aircraft availability allow better assignment of aircrafts to specific missions. Aircraft Availability is important to reach the organisational goals like ‘air power’ and mission availability. The performance indicator Aircraft Availability gives insight of the contribution to the mission capability of the fleet. Depending on the goal, there are different ways to calculate Aircr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11136166